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Abstract We explore the degeneracy and discreteness problems in the standard cosmological 
model (ACDM). We use the Observational Hubble Data (OHD) and the type la supernova 
(SNe la) data to study this issue. In order to describe the discreteness in fitting of data, we 
define a factor Q to test the influence from each single data point and analyze the goodness 
of 0. Our results indicate that a higher absolute value of 0 shows a better capability of 
distinguishing models, which means the parameters are restricted into smaller confidence 
intervals with a larger figure of merit evaluation. Consequently, we claim that the factor 0 is 
an effective way in model differentiation when using different models to fit the observational 
data. 

Key words: cosmological parameters — cosmology: observations — methods: statistical 

1 INTRODUCTION 

The PLANCK (Planck Collaboration et al. 2014) satellite released its first results in 2013, which gave 
tighter constraints of cosmological parameter than before. Extensive observations have been made to con¬ 
strain cosmological parameters including OHD (Yi & Zhang 2007; Zhang et al. 2014; Ma & Zhang 2011; 
Moresco et al. 2012; Farooq et al. 2013; Farooq & Ratra 2013; Yuan & Zhang 2015), SNe la (Suzuki et al. 
2012; Perlmutter &. Schmidt 2003; Riess et al. 1998), cosmic microwave background radiation (CMBR) 
(Dunkley et al. 2009; Komatsu et al. 2011; Hinshaw et al. 2013; Planck Collaboration et al. 2014) and 
baryon acoustic oscillation (BAO) (Eisenstein et al. 2005; Percival et al. 2010) . Qualitatively, the con¬ 
straints imposed by more numerous observations can provide smaller confidence intervals of cosmological 
parameters. However, quantitative studies addressing how well the cosmological parameters are constrained 
if only limited datasets are available. In this paper, we present a new method of factor 0 to investigate this 
issue with OHD and SNe la Data and using the confidence interval and figure of merit (FoM) as inspection 
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2 METHODOLOGY 

2.1 Standard Cosmological Model (ACDM) 

We examine a standard non-flat ACDM model with a curvature term, Ok = 1 - Dm - Oa, and without a 
radiation term (Ma (& Zhang 2011; Farooq et al. 2013). Speciflcally the Hubble parameter is given by 

mz) = //o£(z;Q™,Qa,//o) 

= i/oV^^m(l+z)^+Gk(l+z)2 + QA. (1) 


The relationship between luminosity distance and redshift of SNe la is as below (Riess et al. 1998; Liu 


etal. 2011) 
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The distance modulus is given by following an empirical equation (Perlmutter et al. 1997): 

ju = 5 log Di- 5 log // -H 52.384. 


(3) 


(4) 


Combining Equation (3) and (4), we obtain the relationship between distance modulus and redshift, 
which is dependent upon cosmological parameters. 

Two datasets are utilized to constrain cosmological parameters, the existing 28 OHD (Zhang et al. 2014; 
Simon et al. 2005; Stern et al. 2010; Moresco et al. 2012; Busca et al. 2013; Blake et al. 2012; Chuang & 
Wang 2013) and the SNe la data provided by Supernova Cosmology Project (SCP) (Suzuki et al. 2012), 
which contains 580 type la supernovae with redshifts, distance modulus and errors. 


2.2 Degeneracy and Discreteness 

How the Hubble parameter and the distance modulus depends on redshift are shown in Figure 1 . An in¬ 
spection of Figure 1 suggests that in low-redshift regions, different models predict very similar distance 
modulus, i.e., they are degenerate. Therefore, the OHD and SNe la data in low-redshift regions cannot be 
used to distinguish these models. Here we use Figure 2 to show the observational error of OHD and SNe la 
dataset since error bars is not clearly shown in Figure 1. From figure 2, we And that the values observational 
errors are basically at the level of the red line, and in the region of OHD is from 0 to 30 meanwhile the 
region of SNe la is from 0.1 to 0.3. Besides, the Figure 2 does not shows a obvious relationship between 
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Based upon the likelihood function of single point, we can study the relationship between the goodness- 
of-fit and the final fitting results for a given model. With a set of data for fitting, the probability densities 
of each point show little difference in parameter space, suggesting that the probabilities of all parameters 
are approximately the same. We cannot distinguish the best fitting points, and meanwhile the confidence 
intervals are relatively large. In order to study the quality of the single points, we make a perturbation of 
one parameter to investigate how the likelihood of the single point varies. This can be seen as finding the 
absolute value of derivative of the likelihood. Likelihood depends on parameters (We give the equations in 
section 2.3), which indicates that we can calculate the derivative of likelihood with respected parameters. 

For the OHD dataset, we calculate the partial derivative of H{z\ Hq) with respect to different 

parameters. In this paper, we do not consider the goodness of confidence interval of Hq. 

dH ^ _ Hgzil+zf _ 

2 Vf2m(l + Zy + Qa + (1 - Qm - ^^a)(1 + 

dH -Hoziz + 2) 

- = - (o) 

2 + z? + Qa + (1 - Qm - ^aXI + zf 

For the SNe la dataset, we calculate the partial derivative of distance modulus with respected order 
parameters: n - //(z; Q^,, //q): 
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Noticing that in the above four functions flm + Qa + Ok = 1 is assumed still, and the definition of cosn(jii:) 
is similar to sinn(.r). 


2.3 Definition of Factor g 


cosn(.r) = < 


cosh.r 

X 

cos X 


Qk > 0 
Qk = 0 
Qk < 0 


In the next following equations, we introduce 0 to represent Q^ or Qa, and x represent the observational 
variables, fi or H. For the sake of simplicity, we use a subscript th to denote theoretical values, ob to denote 
observation values, i.e x,/, and Xoh- X. is the symbol of likelihood, i.e.. 
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^2 -W 

The posterior of a model is proportional to the product of the likelihood of each point: 
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Since the observational error is stochastic, randomly up and down to the difference between Xoi_, and 
Xffij. Moreover, only the gradient effect is focused on in our study. Hence we set Xg^j - , - cr - where 

& is the average of observational errors of all data. Under the same condition, we can define the discreteness 
factor Q, that is proportional to ^ ^: 


@(z;e|x) = -^ (13) 

cr oB 

The factor Q depends on the redshifts and the confidence intervals of the data. Small confidence intervals 
are required to distinguish models if the models tend to be degenerate. In the parameter regions where 
the models appear to be discrete, we do not need small confidence intervals. The Q value can be used to 
quantitatively measure the discreteness of data points with errors at different redshifts. 

For a fitting process, if a new observational data point is added, it will take such an effect to the result: 
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Here we use - In X(z, 6) instead of X(z, 6) to simplify formulas. It is not so important to find an explicit 
equation of likelihood. Now we have a criterion to measure the discreteness of different data points caused 
by the model itself In next section, we will apply our method to the existing 28 OHD and 580 SNe la data 
given by SCP to examine the power of the factor Q. 


3 ANALYSIS AND RESULTS 

One of the results that we focus on is the relationship between the redshift z and the factor Q. From the 
definition of the factor Q, we find it is also related to the parameters we select, which means, for different 
parameters at the same redshift, the goodness of discreteness of the data points is different. 

Nowadays constraints on cosmological parameters give the value of fim - 0-32 and Q.\ ^ 0.68 (Planck 
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Dataset 
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SNe la 


all 
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0.275±0.070 

0.712±0.117 

387.18 

SNe la 

Om 

high 

500 

0.287±0.071 

0.751 ±0.126 

353.54 

SNe la 


high 

400 

0.255±0.086 

0.657±0.187 

206.13 

SNe la 


mid 

500 

0.320±0.120 

0.788±0.168 

200.93 

SNe la 


mid 

400 

0.279±0.179 

0.784±0.223 

109.64 

SNe la 
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0.222±0.152 

0.672±0.187 

142.60 

SNe la 

^in 

low 

400 

-1.097±0.357 

0.342±0.353 


SNe la 

Qa 
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0.281±0.072 

0.759±0.727 

353.54 

SNe la 

Qa 
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400 

0.270±0.081 

0.706±0.169 

238.91 

SNe la 

Qa 

mid 
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0.291±0.079 

0.778±0.132 

303.60 

SNe la 

Qa 
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0.810±0.147 
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SNe la 

Qa 

low 

500 

0.307±0.087 

0.743±0.137 

244.70 

SNe la 

Qa 

low 

400 

0.245±0.139 

0.780±0.189 

126.50 

OHD 


all 

28 

0.279±0.078 

0.637±0.260 

128.63 

OHD 


high 

22 

0.280±0.082 

0.643±0.279 

125.11 

OHD 

f2|n 

high 

16 

0.271±0.097 

0.438±0.416 

78.18 

OHD 

f2|n 

mid 

22 

0.870±0.220 

1.480±0.369 


OHD 

f2|n 

mid 

16 

0.779±0.409 

1.230±0.661 


OHD 
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22 

0.740±0.391 

1.220±0.610 


OHD 
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16 




OHD 

Qa 

high 

22 

0.293±0.072 

0.766±0.235 

150.92 

OHD 

Qa 

high 

16 

-0.202± 1.569 

-0.181±2.072 

100.72 

OHD 

Qa 

mid 

22 

0.258±0.091 

0.497±0.422 

79.31 

OHD 

Qa 

mid 

16 




OHD 

Qa 

low 

22 




OHD 

Qa 

low 

16 





Table 1 The fitting results by reordering all data depending on Q. In column and Q./^\ (1) 
Emphasized text means reasonable in ACDM model. (2) Normal text means unreasonable in 
ACDM model. (3) Blanks of (Qm, densl) means no constraint results due to non-convergence. 
(4) Results of (Qjn, densl) in bold text and result in the first row of the table are shown in figure 
4. 


by adding more datasets. After parameters confirmed, the factor ^ will be a function only relying on the 
redshift and observational error. 

Figure 3 shows how factor Q changes with redshift z. Here, for the theoretical value of factor we 
assume that the all standard deviations errors are equal to the average standard deviation error. For obser¬ 
vational value of factor 0, we take observational standard errors. Since the key point is the degree that 
likelihood changes with the cosmological parameter, it is sufficient to focus on the absolute value \Q\. 

From Figure 3, we find that |G(z; Qm|x)| and |G(z;Qa|//)| monotonically increase with redshift, while 
|G(z;Qa|//)| does not, which indicates that for the constraint on Qm, high redshift data show the obvious 
advantage. However, this is not the case for Qa, high redshift datasets of SNe la do not show this kind of 
advantage. Considering the true standard deviation error of each data point. Figure 3 also indicates the true 
value of discreteness. Both true and theoretical deviations error indicate the same tendency, but due to the 
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some data points are at higher redshifts, the factor Q of these data points may be smaller than ^s of lower 
redshift data points. 

Next, we reorder data depending on the absolute value of factor Q of and Qa for both OHD and 
SNe la data. To make comparison, we take different factor Q of different parameters. For OHD, we take 
28, 22, and 16 out of 28 data and while for SNe la, we take 580, 500 and 400 out of 580 data. When not 
all data are selected, we linearly divide data into three levels, the high, the mid and the low ^s of all data. 
Then we employ MCMC method to resample the best-fitting point and explore the changes of the best¬ 
fitting point and confidence interval. We use the publicly available code PyMC (https://github.com/pymc- 
devs/pymc) to perform a full MCMC analysis. The results are listed in Table 1. The results close to PLANCK 
(Planck Collaboration et al. 2014) and WMAP (Komatsu et al. 2011) can be considered reasonable and the 
reasonable results are also close to results using SNe la data and OHD and given by Komatsu et al. (201 1); 
Ma & Zhang (2011); Moresco et al. (2012); Wang et al. (2012); Farooq et al. (2013); Zhang et al. (2014). 

Table 1 indicates that if we select same level of data, for both Qa and Q^, the confidence intervals are 
generally increasing with the quantities of datasets decreasing, suggesting that our factor Q is effective in 
distinguishing the data with discreteness. However, we find that in some cases the Markov Chains of this 
group of data does not converge, or the fitting yields abnormal results. 

Theories of probability and statistics indicate that that the goodness-of-fit increases with the amount of 
observational data. However, in our experiment, we find that the confidence intervals may become smaller 
when we remove data of lower factor Q. There are two ways to datasets’ ability to tigthen the constrains, 
directly checking the standard deviation error of the constrained parameter to examine the constraint on the 
specified parameter and establishing a quantified figure of merit (FoM) to examine the comprehensive effect 
on both parameters. The FoM can be defined as long as it reasonably rewards a tight fit while punishing a 
loose one. We apply the definition of 0.95 confidence region in a parameter space (Albrecht et al. 2006), 
which can be calculated by: 


n f 

FoM^y = - = - 

A cr(0^)cr(0y) - p^y 


(16) 


Here is correlation coefficient between 0^ and 0y, with the relation to covariance matrix = 
cr{0^)cr{0y)pxy. The larger FoM is, the better constraint we get. 

Seeing that in the OHD dataset, we have cr - 0.235 of 22 higher part, smaller than cr = 0.260, if we set 
Qa as the parameter of Q, and in this case, we consider that OHD at z = 0.48, z = 0.88 and z = 1.75 did not 
take positive effect altogether in fitting. Therefore we may pick all these negative points out theoretically. 
Besides, we notice that the of 22 higher part is larger than the one of all data, which means the 

declination of the points even improve the total constraint quality. 

Moreover, we should notice that the standard deviation error of Q^ and Qa contains a correlation due 
to the same cr"'. Here, a point of larger discreteness of Qm may show a relatively larger discreteness of Qa. 

We should mention that the Q factor applies for one parameter in one function, and the effect brought by 
data removal has been ignored. It can be considered that the factor Q we defined reflects the quality of the 




Discreteness and degeneracy in cosmological fitting 


7 


do not have or positive effect on the fitted cosmological model. Furthermore, we deal with these removed 
points in accordance with specific conditions. 

4 CONCLUSION AND DISCUSSION 

In this paper, we develop a new method to study the degeneracy and discreteness in cosmological model. 
We define a criterion called discreteness factor Q that relates the modeling functions, likelihood and un¬ 
determined parameter. The definition of the factor Q is independent of any specific modeling functions, 
hence it can be generalized to an arbitrary modeling process. We start from non-flat ACDM model based on 
the existing 28 observational Hubble data (OHD)(Zhang et al. 2014; Simon et al. 2005; Stern et al. 2010; 
Moresco et al. 2012; Busca et al. 2013; Blake et al. 2012; Chuang & Wang 2013) and 580 type la supernova 
(SNe la) data released by Supernova Cosmology Project (SCP)(Suzuki et al. 2012). The functions indicate 
that theoretically factor Q of increases with redshift, however, due to the different observational standard 
deviations errors for all data, true value of ^s only shows the trend, especially for the OHD dataset. 

We compute the factor Q for Qm and Qa and reorder the data utilizing true value of ^s in both OHD and 
SNe la datasets. We generate Markov Chain Monte Carlo (MCMC) to find the best-fitting points and their 
confidence intervals. The fitting results demonstrate that the Q takes effect on fitting, and higher absolute 
value of 0 gives a stronger constraint and a larger FoM evaluation. Besides, data of lower Q values may 
provide not only larger confidence interval but also unreasonable best-fitting point. But the factor Q displays 
its limitation in some aspect. According to theory of statistics and probability, the confidence interval ex¬ 
plains that some data are removed usually. However, as if the effect of Q is strong enough, it will cover the 
intrinsic properties of statistics and probability. Once we find the intervals decrease or FoM increases with 
fewer data of lower Q value, we may find observational data hardly taking positive effects in constraint. 
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OHD Dataset 



SNe la Dataset 



Figure 1 Theoretical and observational Hubble parameter and the distance modulus of SNe la. 
The upper sub-figure shows the OHD dataset with Icr confidence interval and the theoretical 
//(z; Qni, QA,i/o) value of different models. The lower one indicates the SNe la datasets with 
Icr confidence interval and the theoretical curves predicted by different models. The red, blue 
and green curves represent the model predictions PLANCK (Planck Collaboration et al. 2014), 
WMAP (Komatsu et al. 2011), and the model of full matter, respectively. Black dots and error 
bars indicates observational data. 
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Redshift z Number 


Figure 2 The observational error of SNe la dataset given by SCR The top sub-figures illustrate 
the OHD dataset and the bottom sub-figures iluustrate the SNe la datasets. Besides, the left sub¬ 
figures show the two dimensional diagrams between observational error and redshift, and the 
red lines are the average values of all errors. The right sub-figure shows the histogram of the 
distribution of observational errors (OHD in 10 bins and SNe la in 12 bins). 
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Figure 3 Theoretical and observational factor 0 of OHD set and SNe la dataset. In these two 
sub-figures, blue dashed and dotted-dashed curves represent the theoretically predictions. We 
take OHD set and SNe la dataset into Equation 13 to calculate factor G(z;Qm|//), 

G{z\ OaIH) and 0{z\ OaI/t) of each point, then we plot these ^s as dots in the figure. The verti¬ 
cal black dotted line indicates the point of highest redshift and the horizontal black dotted line 
denotes Q -0. 
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Figure 4 Typical confidence regions in the (Qa, parameter subspace. The solid, dashed and 
dotted contours respectively correspond to 68.3%, 95.4% and 99.7% intervals. The black dot- 
dashed line indicates a flat universe. The interval of red contour is slightly smaller than blue 
contour which is marked bold in Table 1. 
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